PARAMETRIX TECHNIQUES AND MARTINGALE PROBLEM FOR SOME 
DEGENERATE KOLMOGOROV'S EQUATIONS 

S. Menozzi 1 

Abstract. We prove the uniqueness of the martingale problem associated to some degenerate opera- 
tors. The key point is to exploit the strong parallel between the new technique introduced by Bass and 
Perkins |BP09| to prove uniqueness of the martingale problem in the framework of non degenerated 
elliptic operators and the Mc Kean and Singer |MS67| parametrix approach to the density expan- 
sion that has previously been extended to the degenerate setting that we consider (see Delarue and 
Menozzi [DMlOj l. 
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1. Introduction 

1.1. Martingale problem and parametrix techniques 

The martingale approach turns out to be particularly useful when trying to get uniqueness results for the 
stochastic process corresponding to an operator. In a recent work, R. Bass and E. Perkins |BP09j introduced 
in the framework of non-degenerated, non-divergence, time-homogeneous operators a new technique to prove 
uniqueness of the associated martingale problem. Precisely, for an operator of the form 

Lf(x) = ~Tr(a{x)D 2 J(x)), / g C 2 (M d , K), x g K d , (1.1) 

the authors prove uniqueness provided a is uniformly elliptic, bounded and uniformly ^-Holder continuous in 
space (77 G (0,1]), i.e. there exists C > s.t. for all (x,y) g R d , \a(x) - a(y)\ < C(l A \x - That is, 

for a given starting point x g M. d , there exists a unique probability measure P on C(R+,R d ) s.t. denoting by 
(X t )t>o the canonical process, P[X = x] = 1 and for every / g Co(M d ,]R), f(X t ) - f(x) - J Lf(X s )ds is a 
P-martingale. 

In the indicated framework, this result can be derived from the more involved Calderon-Zygmund like LP 
estimates established by Stroock and Varadhan [S V79j . that only require continuity of the diffusion matrix a, 
or from a more analytical viewpoint from some appropriate Schauder estimates, see e.g. Friedman |Fri64) . 

Anyhow, the technique introduced in [BP09] can be related with the first step of Gaussian approximation of 
the parametrix expansion of the fundamental solution of (ll.lj) developed by McKean and Singer I MS6 7] that we 
now shortly describe. Suppose first that, additionally to the previous assumptions of ellipticity, boundcdness 
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and uniform Holder continuity, the diffusion coefficient a is smooth (say C°°(R d , R)). Thus, the fundamental 
solution p(s,t,x,y) of (jl.ip exists and is smooth for t > s, see e.g [Fri64| . Precisely, we have: 

d t p(s, t, x, y) = L*p(s, t,x,y), t> s, (x, y) £ (R d ) 2 , p(s, t, x, .) — > S x (.), 

t-ls 

where L* stands for the adjoint of L and acts on the y variable. For fixed starting and final points x,y € M. d 
and a given final time t > 0, in order to estimate p(0, t, x, y), one introduces the Gaussian process X y = 
x + a(y)W u - s , m£ [s, t], s < t, where (W u ) u <a[o.t-s] is a standard d-dimensional Brownian motion and aa*(y) = 
a(y). Observe that the coefficient of X y is frozen here at the point where we consider the density. Denote 
by p y (s,t,x, .) the density of X y at time t starting from x at time s, and for <p 6 Cg(R d ,R), define by 
L y (f(x) = hTr(a(y)D^.(p(x)) its generator. The density p y (s,t,x, .) satisfies the Kolmogorov equation: 

d s p»(s, t, x, z) = t, x,z), s < t, (x, z) e (R d ) 2 , p y (s, t, ., z) — ► $,(.), 

where L y acts here on the x variable. Take now z — y in the above equation. By formal derivation and the 
previous Kolmogorov equations we obtain: 

p(0,t,x,y) - p y (0,t,x,y) = / dsd s p(0,s,x,w)p v (s,t,w,y)dw 

Jo J& d 

ds / I L*p(0, s, x, w)p y (s, t, w, y) — p(0, s, x, w)L y p y (s, t, w, y) ) dw 
ds / p(0, s, x, w)(L — L y )p y (s, t, w, y)dw 

ds / p(0, s, x, w)H(s 7 1, w, y)dw := p ® H(0, t, x, y), (1-2) 

where ® denotes a time-space convolution. Observe that H(s 1 t,w,y) := (a(w) — a{y))D 2 p y {s,t,w,y). From 
direct computations, there exist c, C > (depending on d, the uniform ellipticity constant and the Lqq bound 

of a) s.t. \D^p y (s, t, w, y)\ < ^ t _ s y/ 2 +i ex P(~ c ^Z^ )■ The previous uniform Holder continuity assumption 
on a is therefore a sufficient (and quite sharp) condition to remove the time-singularity in H. The idea of 
the parametrix expansion is then to proceed in (|1.2j) by applying the same freezing technique to p(0, s, x, w) 
introducing the density p w (0,s,x, .) of the process with coefficients frozen at point z. One eventually gets the 
formal expansion 

p(0, t, x, y) = p(0, t, x, y) + Y / P® H ® k (^ *, v)> (1-3) 

k>l 

where k > 1, stands for the iterated convolutions of H, and V(s,t,z,y) £ (R + ) 2 x R d , p(s,t,z,y) := 

p y (s, t, z, y). The Holder continuity gives that H is a "smoothing" kernel in the sense that there exist c, C > 

(with the same previous dependence) s.t. \p ® iJ® fe (s, t, z,y)\ < C k+1 (t - s) kr >l 2 JJ^i B (l + |) (t - 

s ^-d/2 eX p(_ c l» ~*l ) ; where B(m,n) — J"q s m— 1 (1 — s)™ _1 <is stands for the /3 function. From this estimate, 
equation (|1 .3[) and the asymptotics of the /? function, one directly gets the Gaussian upper bound over a 
compact time interval. Namely for all T > 0, there exist constants c, C > s.t. 

V0 < a < t < T, V(x, y) e (R d ) 2 , p(s, t, x, y) < _^ )rf/2 exp(-c^^L), (1.4) 
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with c, C depending on d, the uniform ellipticity constant and bound of a and C depending on T as well. 
We refer to Konakov and Mammen [KMOO for details in this framework. 

Up to now we supposed a was smooth in order to guarantee the existence of the density and justify the formal 
derivation in (|1.2I) . On the other hand, the r.h.s. of (|1.3I) can be defined without additional smoothness on a 
than uniform 77-Holder continuity. The Gaussian upper bound (|1.4[) also only depends on the Holder regularity 
of a. A natural question is to know whether the r.h.s. of (11.31) corresponds to the density of some stochastic 
differential equation under the sole assumptions of uniform ellipticity, boundedness and Holder continuity on a. 
A positive answer is given by the uniqueness of the martingale problem associated to (jl.ip . Indeed, considering 
a sequence of equations with mollified coefficients, we derive from convergence in law, the Radon-Nikodym 
theorem and (|1.4j) that the unique weak solution of dXt = a(Xt)dWt associated to L admits a density that 
satisfies the previous Gaussian bound. It is actually remarkable that the uniqueness of the martingale problem 
can be proved using exactly the smoothing properties of the previous kernel H. That is what was achieved by 
Bass and Perkins [BP09j in the framework we described and it is the main purpose of this note in a degenerate 
setting. 

To conclude this paragraph, let us emphasize that the previous parametrix approach has been used in various 
contexts. It turns out to be particularly well suited to the approximation of the underlying processes by Markov 
chains, see Konakov and Mammen KM00 ( KM02 for the non degenerate continuous case or [KM 10] for the 
approximation of stable driven SDEs. On the other hand, recently, we used this technique to give a local limit 
theorem for the Markov chain approximation of a Langevin process KMM10 or two-sided bounds of some 
more general degenerated hypoelliptic operators [DM 10) . In particular, in both works, we have an unbounded 
drift term. The unboundedness of the first order term imposes a more subtle strategy than the previous one for 
the choice of the frozen Gaussian density. Namely, one has to take into consideration in the frozen process the 
"geometry" of the deterministic differential equation associated to the first order terms of the operator. This 
will be thoroughly explained in the next section. Anyhow, the strategy of the previous articles allows to extend 
the technique of Bass and Perkins to prove uniqueness of the martingale problem for some degenerate operators 
with unbounded coefficients. 

1.2. Statement of the Problem and Main Results 

Consider the following system of Stochastic Differential Equations (SDEs in short) 



dX\ = F 1 (t, X},..., X?)dt + a(t, Xl..., X?)dW t , 
dX?=F 2 {t,X},...,X?)dt, 

dXf = F 3 {t, X?, . . . , X?)dt, t > 0, (1.5) 

dX? = F n (t,X?-\X?)dt, 



(Wt)t>o standing for a d-dimensional Brownian motion, and each (Xl) t >o, 1 < i < n, being Revalued as well. 

From the applicative viewpoint, systems of type (|1.5p appear in many fields. Let us for instance mention for 
n = 2 stochastic Hamiltonian systems (see e.g. Soize |Soi94] for a general overview or Talay |Tal02j and Herau 
and Nier HN04] for convergence to equilibrium). Again for n — 2, the above dynamics is used in mathematical 
finance to price Asian options (see for example [BPVOl ). For n > 2, it appears in heat conduction models 
(see e.g. Eckmann et al. [EPRB99 and Rey-Bellet and Thomas |RBT00j when the chain is forced by two heat 
baths). 

In what follows, we denote a quantity in M. nd by a bold letter: i.e. 0, stands for zero in M. nd and the solution 
(Xj , . . . , A™)f>o to (|1.5|) is denoted by (Xt)f>o- Introducing the embedding matrix B from M. d into R™ d , i.e. 
B = (Id, 0, . . . , 0)*, where "*" stands for the transpose, we rewrite (|1.5I) in the shortened form 



dX t = F{t,X t ) + B<r{t,X t )dW t7 
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where F = (Fi, . . . , F n ) is an R™ rf -valued function. Moreover, for x = {x\, . . . , x n ) G (M. d ) n , we set x 1 '" = 

(xi,...,x n ) e (R d ) n - l+1 . 

We introduce the following assumptions: 
(R-?/) The functions (fi)ig[i,n] are uniformly Lipschitz continuous with constant k > (alternatively we can 
suppose for i = 1 that the drift of the non degenerated component F± is measurable and bounded by k). The 
diffusion matrix (a(t, .))t>o is uniformly 77-Holder continuous in space with constant K, i.e. 

\a(t,x) — a(t,y)\ 

vt > 0, sup 1 — < «■ 

(UE) There exists A > 1, Vi > 0,x G W ld , £ G R d , A" 1 ^ 2 < (a(i,x)f,£) < A|£| 2 . 

(ND-77) For each integer 2 < i < n, (t, (x h . . .,£„)) G M+ x RCn-i+i)^ the function x t -i el^ Fi(t,x <-1 '") 
is continuously differentiable, the derivative, (t,x 2-1 '™) e K + x R("^ i + 2 ) d h-> D Xi _ 1 F i (t,x*~ 1 ' n ), being ry-Holder 
continuous with constant k. There exists a closed convex subset £i-i C GLd(R) (set of invertible dxd matrices) 
s.t., for all t > and (xi-i, . . . ,x n ) € ]R(™~ J + 2 ) d , the matrix D Xi _ 1 F i (t,x. l ~ 1 ' n ) belongs to £i-\. For example, 
1 < i < n. — 1, may be a closed ball included in GLd(K), which is an open set. 



Assumptions (UE), (ND-77) can be seen as a kind of (weak) Hormander condition. They allow to transmit 
the non degenerate noise of the first component to the other ones. Also, the particular structure of F(t, .) = 
{Fi(t, .),••• ,F n (t,.) yields that the i th component has intrinsic time scale (2i — l)/2, i € [1, nj. We notice that 
the coefficients may be irregular in time. The last part of Assumption (ND-77) wn l be explained in Section [2TTI 
We say that assumption (A- 77) is satisfied if (R-77), (UE), (ND-i/) hold. 

Under (A-ry), we established in [DM 10] Gaussian Aronson like estimates for the density of (I1.5P over compact 
time interval [0, T], for r\ > 1/2. Precisely, we proved that the unique weak solution of (| 1 . 5|) admits a density 
that satisfies that for all T > 0, 3C := C(T, (A-17)) s.t. V(f,x,y) £ (0,T] x « nd x R" d : 

C -h- n2d ' 2 exp (-Ct\T^(e t (x) - y| 2 ) < p(t, x, y) < Ct^ 2 exp (-Cr^T^^x) - y| 2 ) , (1.6) 

where T t := diag((r; i / ( i) i6 j l n j) is a scale matrix and 8 t {x) = F(t, #t(x)), ©o( x ) = x - 

To derive (|1.6[) , we proceeded using a "formal" parametrix expansion considering a sequence of equations 
with smooth coefficients for which Hormander's theorem guaranteed the existence of the density, see. e.g. 
Hormander [H6r67j or Norris [Nor86 . Anyhow, as in the previous paragraph, our estimates did not depend on 
the derivatives of the mollified coefficients but only on the rj- Holder continuity assumed in (A-fy). Anyhow, to 
pass to the limit following the previously described procedure, some uniqueness in law is needed. Using the 
comparison principle for viscosity solutions of fully non-linear PDEs, see Ishii and Lions IIL90) . we managed to 
obtain the bounds under (A-7/), 77 > 1/2. However, the viscosity approach totally ignores the smoothing effects 
of the heat kernel and is not a "natural technique" to derive uniqueness in law. 

Introduce the generator of (|1.5[) : 

g C 2 (R + x R nd ,M), V(t,x) G M+ x R nd , L t ip(t,x) = (F(t,x),D x y>(t,x)) + ^(a^x^^x)). (1.7) 

Adapting the technique of Bass and Perkins |BP09j we obtain the following results. 

Theorem 1.1. For rj G (0, 1], under (A-77) the martingale problem associated to L in (|1.7[) is well-posed. In 
particular, weak uniqueness in law holds for the SDE (|1.5[) . 

As a bypass product we derive from [DM10] the following: 

Corollary 1.1. For r\ in (0, 1], under (A-rj), the unique weak solution of (|1.5j) admits for all t > a density 
that satisfies the Aronson like bounds of equation (|1.6j) . 
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2. Choice of the reference Gaussian process for the parametrix 

In this section we describe the Gaussian processes that will be involved in the study of the martingale problem 
and that have been previously involved in the parametrix expansions of |DM10| . We first introduce in Section 
I2.1l a class of degenerate linear stochastic differential equations that admit a density satisfying bounds similar to 
those of equation (|1.6I) . We then specify, how to properly linearize the dynamics of (|1.5p so that the linearized 
equations belong to the class considered in Section [2~TI 

2.1. Some estimates on degenerate Gaussian processes with linear drift 

Introduce the stochastic differential equation: 

dG t = L t G t +BZ t dW t (2.1) 

where L t x = (0, ajx\, . . . , a™~ 1 x n -i)* + UtX, and U t 6 M. nd ® M™" 1 is an "upper triangular" block matrix with 
zero entries on its first d rows. We suppose that the coefficients satisfy the following assumption (A llnear ): 

- The diffusion coefficient A t := E t SJ, t > 0, is uniformly elliptic and bounded, i.e. 3A > 1, V£ G M d , A -1 |£| 2 < 
(M,0<A\tf. 

- For each i G [1, n — 1], there exists a closed convex subset £ j C GL<}(M.) s.t. for all t > 0, the matrix a\ belongs 
to Si. 

Denoting by (R(s, i))o<s,t the resolvent associated to (L t ) t > , i.e. <9 t R(i, s) = L t R(t, s), R(s, s) = l n d, we 
have for 0<s<i,x£ R nd , G s t ' x = R(i, s)a;+ /* R(i, ujSS^dWu and Cov[G t s,a: ] = /* R(t, u)SA„S*R(t, u)*du := 
K(s,t). 

From Propositions 3.1 and 3.4 in [DM 10) . the family (K(s, i)) tS ( Sj T] of covariance matrices associated to the 
Gaussian process (Gf' x ) t £( s .T] satisfies, under (A lmoar ), a "good scaling property" in the following sense: 

Definition 2.1 (Good scaling property). Fix T > 0. We say that a family (K(s, t))te[s,T]) s G [0,T) of 
M. nd (g)M. nd matrices satisfies a good scaling property with constant C > 1 ('see ako Definition 3.2 and Proposition 
34 of WEM ) *f f°r all (t,s) G (K+) 2 , < i - s < T, Vy G R nd , C'^t - s^T^y] 2 < (K(s,t)y,y) < 

Precisely the family (K(s,t))te( s ,T] satisfies under (A llncar ) a good scaling property with constant C := 
C(T,(A lincar )). 

Remark 2.1. We point out that it is precisely the second assumption of (A llnoar ) concerning the existence of 
convex subsets (£j)j e in n _i] of GLd(Jt) that guarantees the good scaling property (see Propositions 3.1 and 3.4 
in \DM10$ for details) '. 

The density at time t > s in y G M nd of Gj' x writes 

9(S ' *' X ' y) = (2 7 r)"^det(K, t ) 1 / 2 6Xp(_ S ^' s ^ s ) x ~ ^ 

Since under (A llncar ), (K(s, i))te(s,T] satisfies a good scaling property in the sense of Definition [2Tj we then 
derive from (|2.2[) : 

Proposition 2.1. C/nrfer (A lincar ), for all T > t/iere errisfc a constant C(zrj\:= tfgjfT, (A lincar )) > 1 s.t. : 

G^jft - s) n2d / 2 expi-O^t - s)\T-} s (R(t, s)x - y)| 2 ) < q(s, t, x, y) 

< qgjft sf d ' 2 ex V {-Cg^t - S )|T7_! s (R(t, S )x - y)| 2 ). (2.3) 
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This means that the off-diagonal bound of Gaussian processes with dynamics (|2.1[) and fulfilling (A llnear ) is 
homogeneous to the square of the difference between the final point y and R(i, s)x (which corresponds to the 
transport of the initial condition by the deterministic system deriving from (|2.1| . that is 9 t = Lt^t) rescaled 
by the intrinsic time-scale of each component. We here recall that the component i £ [1, d| has characteristic 
time scale ^ 2l_1 )/ 2 . 



2.2. Linearization of the initial dynamics and associated estimates 



The crucial feature of the parametrix method described in the introduction was to choose a "good" process to 
approximate the density of the diffusion. In the uniformly elliptic case, with bounded coefficients, one could take, 
as a first approximation, the Gaussian process with coefficients frozen in space at the fixed final spatial point 
where we wanted to estimate the density. The choice is natural since it makes the kernel H (defined in (jl.2|l ) 

-c\x-y\ 2 /(t-s)) 



"compatible" with the bounds of the frozen density. It is precisely the off diagonal term in exp(- 
that allows to equilibrate the singularity in \x — y\ v /(t — s) coming from the second order spatial derivatives. 
In their work, |BP09j . Bass and Perkins exactly exploited the specific behavior of the singular kernel H which 
has an integrable singularity in time at (see their Proposition 2.3), to derive uniqueness of the martingale 
problem in the non-degenerate time-homogeneous framework. This approach provides a natural link between 
parametrix expansions and the study of martingale problems for uniformly Holder continuous coefficients. 

Parametrix expansions, to derive density estimates on systems of the form (ll.5[) . have been discussed in 
[DM 10]. We thus have in the current degenerate framework of assumption (A-r]) some natural candidate 
defined below. The key idea is to consider a "degenerate" Gaussian process whose density anyhow has a specific 
"off-diagonal" behavior similar to the one exhibited in equation (|2.3p and to choose the freezing process in order 
that the singularity deriving from H is still compatible with the "off-diagonal" bound in the sense that it will 
be sufficient to remove the time-singularity. 

We follow the same line of reasoning in our current framework. 

For fixed parameters T > 0,y G M. nd , introduce the linear equation: 



dXj' y = [F(t, t , T (y)) + DF(t, e f , T (y))(xf < y - t>T (y))] dt 
+ Ba(t,0 t , T (y))dWt, < t < T, 



(2.4) 



where (6 t r(y))*>o solves the ODE [d/dt]6 t r(y) = F(t, 6 t r(y)), t>0, with the boundary condition 0rr(y) 

/ ' ' \ 

D xl F 2 (t,x) ••• ••• 



y and V(*,x) £ [0,T] x M" d , DF(t,x) = 











D X2 F 3 (t,x) 
: 

■•• D^Fnfox) / 

diagonal of the Jacobian matrix D X F. Write p T ' y (i, T, x, .) for the density of X^' y starting from x at time 



V 



is the sub- 



t. 



The deterministic ODE associated with X T,y has the form 



d_~ 

df 



4> t = F(t, t , r (y)) + DF(t, t ,T(y))[0 t - 8 t , T (y)], t > 0. 



(2.5) 



~T,y ~T.y 

We denote by (6 t ' s ) s ,t>o the associated flow, i.e. t s (x) is the value of <j> t when <f> s = x. It is affine: 



0jf(x)=R T ' y (t, S )x 



[ R T *(t,u)(F(u,6 UtT (y)) - DF(u,e UtT (y))6 u>T (y))du. 

J S 



(2.6) 
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Above, (R T ' y (i, s)) s t >o stands for the resolvent associated with the matrices (DF(t,9 t .T(y)))t>o- 
We now claim 

Lemma 2.1. Let Tq > be fixed. There exists a constant G^T]\ > 1, depending on (A) and Tq such that, for 
any t <E [0, T), T < T and x, y 6 R nd , 

C^T^_ t [x-e t , T (y)]\< |T T i f [^;f(x)- y ]| < q^T^ [x - t , T (y)] | . 

This means that we can compare the rescaled "forward" transport of the initial condition x from t to T by the 
linear flow and the rescaled "backward" transport from T to t of the final point y by the original deterministic 
differential dynamics. We refer to Lemma 5.3 of [DM10 for a proof. 

Furthermore, under (A-rf) we have that D~F{t,0t } T{y)) satisfies (A hnear ). We thus derive from Lemma \2. II 

and a direct extension of Proposition 12.11 (the mean of X^' y starting from x at time t being 6 T ' y (x)) the 
following result. 

Lemma 2.2. Let Tq > be fixed. There exists a constant Qg^g| > 0, depending on (A-rj) and Tq such that, for 
allO <t <T <T awixje R nd , 

p T ' y (t,T,x,y) < q^jgo^T^tjx - d t , T (y)), 

where for alla>0,t> 0, g a ,t(y) = exp(-a- 1 i|T f T 1 y| 2 ), a,t > 0, ye R nd . 

For all < s < t < T, z, y e R nd , define now the kernel H as: 

H (s, t, z, y) = {(F(s, z), D^(s, t, z, y)) + ^Tr(a(s, ^D^is, t, z, y))} - 
{{F(s, 6 s , t (y)) + DF(s, e s , t (y))(z - 0., t (y)), T> m f*(s, t, z, y)) 
+ ~Tr(a(s, s , t (y))^ i p^y( Sj t> Zj y ))} = z _ L%)p^(s, t, z, y), (2.7) 

where L is the generator of the initial diffusion (| 1 . 5|) defined in (ll.7[) . L*' y and p t ' y (s,t,z, .) respectively stand 
for the generator at time s and the density at time t of X*' y , X* ,y = z with coefficients "frozen" w.r.t. t,y. 
The lower script in z is to emphasize that z is the differentiation parameter in the operator. 
We have the following control on the kernel (see Lemma 5.5 of [DM10 for a proof). 

Lemma 2.3. Let Tq > be fixed. There exists a constant G^rtft > 0, depending on (A-rj) and Tq such that, for 
all t 6 [0, T), T < Tq and z, y £ W ld , 

\H(t, T, z, y)| < qggT - t^gc^T-t (z - t , T (yj) ■ 

We conclude this section with a technical Lemma whose proof is postponed to Appendix |A"1 
Lemma 2.4. Let h be a C Q {W ld ,R) function. Define for all (s,x) e [0,T] x R nd , 

Ve>0, H E ( S ,x):= f dyh(s, y)p s+£ ' y (s, s + e, x, y). 

Then a e (s,x) converges boundedly and pointwise to h(s,x) when e — > 0. 
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3. Proof of Theorem 11.11 

Suppose we are given two solutions Pi,P2 of the martingale problem associated to (L t ) t! z[ s T ^ starting in x 
at time s. W.l.o.g. we can suppose here that T < 1. Define for a bounded Borel function / : [0, T] x W ld —> R: 



S\f := Ei 



dtf(t,Xt) 



ie [1.2], 



where (X t ) fe [ s j| stands here for the canonical process associated to (Pj^eiTi^]- Let us specify (as indicated 
in [BP09] ) that Sf is only a linear functional and not a function since Pi does not need to come from a Markov 
process. Let us now introduce 

S A f:=S 1 f-S 2 f, 6:= sup \S A f\. 

II/IU<1 

Clearly Q <T — s. 

If / G C$([0,T) x R nd ,R) then by definition of the martingale problem we have: 



/(*,*) 



J dt(d t + L t )f(t,X t ) 



0, ie [1,2]. 



(3.1) 



For a fixed point y e W ld and e > 0, introduce V/ G C 1,2 ([0,T) x R' id ,R) the Green function 
V(s,x) G [0,T) x R nd , G £ ' y /(s,x) := / eft / dzp t+e ' y (s, i, x, z)f(t, z). 



(3.2) 



We insist that in the above equation p t+£ ' y (s, t, x, z) stands for the density at time t and point z of the process 
X*+ £ >y defined in (|2.4[) starting from x at time s with coefficients depending on the backward transport of the 
freezing point y by {d u ,t+e)ue[s,t]- ln particular, the parameter e can be equal to in the previous definition. 
One easily checks that 

(8 S + L^p^is, t, x, z) = 0, V(s, x, z) e [0, t) x (R" d ) 2 , p t+ ^ (a, t, x, .) — > «5 X (.). (3.3) 
Introducing for all / G C ^ 2 ([0,T) x R IKi ,R), € [0,T) x R" d , 

M£ y /( S , x) := [ T dt [ 4*1%?*?+** (a, t, x, z)f(t, z), 

we get from equations (|3.2[) . (|3.3|) 

(a s G^/ + M|; y /)( s ,x) = -/( S ,x),V( S ,x) g [0,T) x R» d . (3.4) 
Define now for a smooth function h G Cq' 2 ([0,T) x R nd ,R) and for all (s,x) G [0,T) x R nd : 

$ £ ' y ( S ,x) := p s+£ ' y ( S , S + £ ,x,y)^( S , y ), 
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Observe that (|3.2p yields: 

#eO,x) := f dyf dtf dzp t+£ - y (s : t,x,z)<I> £ > y (t,z) 

JR" d Js JK" d 

= / dy [ dt [ dzp t+£ ' y (s,t,x,z)p t+£ > y (t,t + e 7 z,y)h(t,y) 

jR" d Js JM" d 

= [ dyf dtp t+e > y (s,t + e,x,y)h(t,y), (3.5) 

JR™ d Js 

exploiting the semigroup property of the frozen density p t+e ^ for the last inequality. Write now for all (s, x) 6 
[0,T) x R nd , 

(d s +L s )V e ( S ,x) = I dy(d s +L s )(G £ < y $ £ < y )( S ,x) 

Jw nd 

= [ dy{d s G £ ' y <5> £ ' y + Af s e ' y $ e < y )(s,x) + / dy(L s G e < y $ e < y - Af s e < y $ e < y )(s,x) 

= - / dy<F> y ( S ,x) + f dy(L s G £ *&*-MZ*$ s *)(s,x) := If + 1 £ 2 , 

exploiting (13.41) for the last but one identity. Now Lemma \2A\ gives If — > —h(s,x). On the other hand using 
the notations of Section l2~2l we derive from (|3.5[) that the term J| writes: 

If = J dtj dy{L. - Ll+ x £ ' y )p t+£ > y ( s , t + e, x, y)ft(t, y) 

= y dtj dyH(s,t + e,x,y)h(t,y). 

Thus, from Lemma 12.31 

I'll < 0^\h\\^J dt{t + e-s)- 1+ ^ 2 J Jyga^ t+E - s {^-e Slt+s {y)) 

< WHooC^J dt(t + e - s)- 1+ ^ 2 J Jyga^t+e-siet+eA^) ~Y)< ^M(T - s) V e}" /2 |H|oo, 

(3.6) 

using the bi-Lipschitz property of the flow for the last but one inequality and up to a modification of C j^g] 
in the last one. Anyhow, the constant C j^g] only depends on known parameters in (A-r/). Thus for T and e 
sufficiently small we have from (|3.6|> 

I'll < ~IN°o- (3.7) 
Now, equation (|3.1|) and the above definition of S A yield: 

s A ((d s + L,)* e ) = o \s A i s \ = \s A i £ 2 \. 

From the bounded convergence part of Lemma [2~4l and (|3.7|) . we have: 

|S A /i| = lim \S A If\ = lim |S A / 2 £ | < Olimsup |/ 2 £ | < 9^^. 
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By a monotone class argument, the previous inequality remains valid for bounded measurable functions h 
compactly supported in [0,T) x R nd . Taking the supremum over || 



= since < +00. Hence, Ei 



£ dth(t,X t ) 



= E 2 



/; dth(t,x t 



< 1, we obtain < ^0 which gives 
which proves the result on the interval 



[0, T]. Regular conditional probabilities then allow to extend the result on R + , see Chapter 6.2 of [SV79] for 
details. 



Appendix A. Proof of Lemma 12.41 

Let us denote by K* "^ £ the covariance matrix associated to equation (|2.4[> for the process X s+e,y (starting 
from x at s) at time s + e and by , £ the covariance matrix associated to the linear diffusion with dynamics: 



dX t = DF(t,0 tjS (x.))X t dt + Ba(t,9 trS (x))dW t ,t > s, 



(A.l) 



that is K*j* +e = J^ +E duR s ' x (s + e, u)Ba*(u, 8 UyS (x))B*R s ' x (s + e, u)*du where R s x stands for the resolvent 
associated to the linear part of (|A.1|) . 

Under (A-77), the matrices K*~^fjl £ , K^' x +£ admit a good scaling property in the sense of the previous 
Definition 12. 1[ i.e. 



3C := C((A- V )) > 1, V€ G « nd , C^s-^T^f 2 < (k£^£,€) < Ce^lT^ 2 , 



(A.2) 



We introduce the following decomposition: 



2 e (*,x) : = J^-^-h^y)^ (~{(0 _1 (Cw-y).Cw-y! 



det(K;+5l?)Va det(K 



s . x U/2 



^(s,y) 



lti (2^" d / 2 det(K^ +£ )V2 



cx P ( -^((k^j-hC^W -y)XXZ(*) -y) 



dy h(s,y) 

» d (2tt)^/2 det(K^ +£ )V2 



cxp ^-^((K^J-H^Cx) -y),C2rW -y) 
cxp (|-i((K^ +£ )- 1 (0 s+e , s (x) - y), s+£ , s (x) - y) 

— expf-i((K^ +£ )- 1 (0 s+£ , s ( X )- y ),0 s+ ^(x)-y)) x). (A.3) 



./ _Jy 

7 R „ d (2ti> 



Hs,y) 



(2tt)^/2 det(K^ +£ )V2 



;.=i 



Let us now control the (Sf(s,x))j e [i ) 4j. Set y := (K*; x +£ ) 1 / 2 (0 s+£jS (x) — y) where (K^ x +£ ) 1 / 2 denotes the 
upper triangular matrix obtained through the Cholesky factorization, i.e. (K^ x +£ ) _1 = ((K^ +£ ) _1 / 2 )*(K 
We get from the bounded convergence theorem: 



s,x \-l/2 



|y' 2 



(A.4) 
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For Eg(s, x), we first observe, from Lemma I^TTl the good scaling property (|A.2[) and the bi-Lipschitz property 
of the flow 0, that there exists C\ := Ci((A-t/)) > 1 s.t. 



crV/^cCjM -y)| < kk^-^CJw -y)l < ^"ITT^CSW -y)l, 

CrV/ 2 |H7 1 (e a+e , B (x) -y)| < |(K; : ^ +e )- 1 / 2 (0 a+e , i (x) -y)| < deV^i^^x) - y)|, 

Cr^^lT-^^+^W - y)l < e^liT'cCjfc) - y)l < Ci^It^+^W - y)|. (A.5) 



Write now, 



l«x,y)'WI < 



rfy 



exp 



(A.6) 



(0 s+£ ;:(x)-y) ) e s ; E ;:(x)-y) + 



((K; : * +e )- 1 (e i+£ , a (x) -y),e s+e>s (x) -y) - ((k^+j-hC^W - y)> - y)] }) > 

(K^)" 1 / 2 {(0 s+£) ,(x) - y) + (CW - y)}| 
(K::: +£ )- 1/2 {(0 s+£ , s (x) - y) - (^(x) - y)}\tf^ y {S), 



using the Cauchy-Schwarz inequality for the last assertion. Equations (|A.5[) now yield that there exists C2 := 
C 2 ((A-r?)) > 1 s.t.: 

I«x, y )'(5)| < C^T- 1 (C^W |exp(-C 2 - 1 £ |T- 1 (0 s+£ , s (x) -y)| 2 ) 

:= C 2 |£> £ | exp (-C^elT^+^x) - y)| 2 ) . (A.7) 

Let us now recall the differential dynamics of # s + E)S (x), s ^ e '^(x), that is s+e s (x) = x + J s+E duF(u, U)S (x)) 

and S£j(x) = x + du{F( U , u , s+e (y)) + DF(u, 0„, s+£ (y))(C/ y (x) - 0» )S+E (y))}. Set F'+^(u, z) := 
(Fi(m, u>s+£ (y)), F 2 (u, zi, (0 UjS+e (y)) 2 ' n ), F 3 ( (0„ iS+e (y)) n ))*, V(u, z) s 

[s, s + e] x M" d . Observe in particular that F s+£ *{u, 0„, s+£ (y)) = F{u, 0„, s+e (y)). We get: 



F( W) 0„, s (x)) - F s +^ (u,O u , s (x)fj + (l)F( W ,0 u , s+e (y))(0„, s (x) - 0^' y (x))^ 
jT dS (£>F s+e ^(u, 9 a , s+e (y) + <J(0„, s (x) - 0„, s+£ (y))) - #F s+£ >>>, 0„, s+e (y))) (0„, s (x) - 0„, s+e (y))) | 



: 1/2 T -1 



Dl+D 2 + Dl, 



(A.8) 



where for (u, z) G [s, s + e] x R nd , £>F s+£ 'y (u, z) is the (nd) x (nd) matrix with only non zero d x d matrix entries 
(^(m.z)),^! := D XJ _ 1 F i (u,z i _ 1 ,0 lt , a+e (y)A n ), J £ [2,n], so that in particular DF s+£ ' y (u, UiS _|_ e (y)) = 
DF(u,e UtS+e (y)). 
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The structure of the "partial gradient" DF s+e ' y associated to the 77- Holder continuity of the mapping Xj-i £ 
1 >-» Dxf_ 1 F i (xj_ 1) x^ n ),Vx'' n £ R("-J+ 1 ) d yield that there exists C 3 := C 3 ((A-»7) ) s.t. for all j £ [2,d]: 



r s+e n 



fc=2 

d«(e 1 /2|T7i(e itia ( x ) - U , S+E (y))|) 1+ " 

/•s+e 

J S 

< C 3 e"«- 3 / a >(e 1 / 2 |'ir7 1 (fl a+(I(i (x)-y)|) 1 - H ', (A.9) 

up to a modification of C3 and using the bi-Lipschitz property of the flow for the last but one inequality (see 
the end of the proof of Proposition 5.1 in [DM10] for details). 

On the other hand, the term D\ can be seen as a remainder w.r.t. the characteristic time scales. Precisely, 
there exists C4 := 64(^-77)) s.t. for all j £ [l,n]: 

ps+e n 

\(Dl) 3 \ < C^l^-n du^2\(e u , a (x)-0 u , a+s (y)) k \ 



k=j 



■ c 4 



J S 



< C 4 £(£ 1/2 |T £ - 1 (0 s+ ^(x)- y )|) (A.10) 

using once again the bi-Lipschitz property of the flow 9 for the last inequality. Recall now that Df is the linear 
part of equation (|A.8|) . i.e. it can be rewritten 

Dl = f S du{e 1 l i T- x DV{u,e u ,s +e {y)){u-s)- x / 2 T u ^) ((« - s^X-^-mC*) - 6^' r {x))) 
dua£(u, s) ((u - ») 1/a T-±.(<M*) - (x))) , 



s+s 



where there exists a constant C := (7((A-ry)) independent of e s.t. J^ +e du\a^(u, s)\ < C. From (|A.10I) . ()A.9|) . 
(|A.8|) and Gronwall's Lemma we derive 

3C 5 := C B ((A - r])), |£> £ | < C 5 s^ 2 ((s 1 / 2 \T- 1 (e s+£tS (^ - y)|) 1+ " + 1). 

Plugging this estimate into (IA.7I) . we then get from (|A.6|) . using as well the good scaling property (|A.2|) . that 
there exists Cq := Cq((A-t])), 

|S 3 ( S) x)| < C^' 2 [ ^((^iT-i^^^.^^+^ + ^exp^c-^lTj^e^C^-y)! 2 ) 

< C 6 e"/ 2 , (A.ll) 



up to a modification of C§ in the last inequality. 
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Let us consider now S|(s,x). Write: 

dy h(s,y) 



|3 £ 2 ( S ,x)| < C; 



„„ (2^/2 det(K^ +£ )V2 J Q 

r s<x<y (S) = e Xp ^-i|<(K^ +e )- 1 (e:^(x)-y),Cj;rW-y 
+ s 



dSm, x , y )'(S)\, C 7 := C 7 ((A - 77)), V<5 e [0, 1], 



(A.12) 



s ; s+e ) {0 S+E:S (x) - y), s+EjS (x) - y) 



m,*j(5)\ < <((K:+ E 4)- i -(K^ + j- i )(C:;r(x)-y),C";r( x )-y> f w w- 



Equations (jA.5[) (that according to (| A.2|) hold for K*!^ e s as well) yield: 

W S , X J(S)\ < ^({((K^)" 1 - (K^rWstZW -y),C»W -y)|e X p(-C £ |T- 1 (0 s+£ , s (x) -y)| 2 ) 
:= CIQelexpl-CelT.-^e^l^-y)! 2 ), (A.13) 

for C := C((A-?7)). From the scaling Lemma 3.6 in |DM10) . we can write: 

K+Jg = e - 1 T e K* , " +B, * +e ' y T ej K; : = e - 1 T e K;' a+e '*' x T e , 



where K. 1 



s,s+e,s+E,y 



, iq> s + E > s > are uniformly elliptic and bounded matrices of ] 



Now, the covariance matrices explicitly write 



■r>s+£,y _ 



K 



duR s+£ - y (s + e, u)Ba(u, d UtS+e (y))B*Tl s+e < y (s + e, u)*, 



rs+s 

/ duR s '*(s + e, u)Ba(u, 6>„ !S (x))B*R s - x (s + e, u)* 

J s 



where R s+e,y , R sx respectively denote the resolvents associated to the linear parts of equations (|2.4p and 
(|A.1[) . Thus, our standing smoothness assumptions in (A) (i.e. a, (V^_iF,)j g j2 iTl ] are supposed to be uniformly 
?7-H61der continuous) and the bi-Lipschitz property of the flow give: 



<(K££ - R^J^^x) - y), BUZ (x) - y) 
^, s+e , s+E , y _^, s+e>s , X)(£ _ 1/2TE( ^ : y (x) _ y)))£ -i/2 Te( ^ (x) _ y)) 

< C|0 s+e , s (x) - yl^-^T^^Cx) - y)P 



O s,s+£,s+£,y 

Because of the non degeneracy of a, the inverse matrices ( 



(K^ s ) have the same Holder 



regularity. Indeed, up to a change of coordinates one can assume that one of the two matrices is diagonal at 
the point considered and that the other has dominant diagonal if |0 s+£jS (x) — y| is small enough (depending on 
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the ellipticity bound and the dimension). This reduces to the scalar case. Hence, 



\Qe 



<((k:+ e + 



(k^j- 1 )^:;^^ -y),~eZZW -y) 



O s,s+e,s+e,y 

(( Ki 



- K 



S,S+£,S,X\ 



)(e 1 / 2T -i(^ (x) _ y) ), e V 2T -i ( ^ (x) _ y)) 



<c\e s 



,{*)-y\v E \T-\d s l e Z{*) 



y)| 2 < C|0 s+£ , s (x) - y|"e|Tr 1 (fl a+B) .(x) - y)| 



for C := C((A-?7)) using Lemma \2. II and the bi-Lipschitz property of the flow 9 for the last inequality. From 
equations (|A.13I) , (|A.12[) and (IA.2|) . we eventually get: 

|S e 2 ( S) x)| < Ce^ 2 [ -^( £ V2| T -i (0s+£s(x) _ y )|)^exp(-C £ |T- 1 (0 s+£ . s (x) -y)| 2 ) < Ce^\ (A.14) 

for C := C((A-r))). Arguments similar to those employed for HS^s, x) can be used to prove a\ (s, x) — > 0. The 
proof then follows from (|A.4[) . (|A.11[) . (|A.14[) recalling the original decomposition (|A.3j) . □ 
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